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Abstract 



We discuss two different ways of chromosomes' and genomes' evo- 
lution. Purifying selection dominates in large panmictic populations, 
where Mendelian law of independent gene assortment is valid. If the 
^^ populations are small, recombination processes are not effective enough 

to ensure an independent assortment of linked genes and larger clusters 

^ of genes could be inherited as the genetic units. There are whole clus- 

^ ters of genes which tend to complement in such conditions instead of 

r — single pairs of alleles like in the case of purifying selection. Computer 

'-'i simulations have shown that switching in-between complementation 

^^ and purification strategies has a character of a phase transition. This 

is also responsible for specific distribution of recombination events ob- 
served along eukaryotic chromosomes - higher recombination rate is 
observed in subtelomeric regions than in central parts of chromosomes 
- for sympatric speciation and probably for non-monotonous relation 
between reproduction potential and genetic distance between parents. 
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1 Introduction 

Contemporary, common knowledge of genetics is based mainly on Mendelian 
laws of inheritance integrated with Morgan chromosomal theory. Mendel 



concluded, from the results of his experiments, that there should be some 
hereditary units (he called them "factors") which are responsible for trans- 
ferring some traits from parents to their offspring. These factors can be 
dominant, like that one responsible for purple colour of pea flowers or re- 
cessive, like that one responsible for white flowers of pea. The last one is 
hidden in the plants which possess both types of factors. In the normal so- 
matic cell (diploid) two corresponding factors exist for each trait but they 
are separated when a haploid gamete is produced during meiosis. Mendel's 
laws describe such a separation of factors belonging to one pair responsible 
for a given trait or belonging to more pairs responsible for different traits. 
The first Mendel law - Law of Segregation or purity of gametes - says that 
each gamete receives only one factor of a pair (one allele of a gene). The 
second Mendel law - Law of Independent Assortment - says that alleles of 
different genes assort independently of each other during gamete formation. 
The Mendel Laws were described in 1865 [TT\ and rediscovered by de Vries 
in 1900 03]. A few years later they have been supported by Boveri [2] 
and Sutton |l0] who suggested that chromosomes can carry the hereditary 
traits intimated by Mendel. But it was T. Morgan who, after many experi- 
ments performed with D. melanogaster, supplied the clear evidence that the 
chromosomes are the real vehicles of hereditary information. He published 
with his colleagues in 1915 a paper Mechanism of Mendelian Heredity |25) . 
They stated that Mendelian factors are like beads located on chromosomes 
- pairs of the corresponding beads on the pairs of homologous chromosomes. 
They segregate with chromosomes into germ cells and again combine in zy- 
gotes, exactly as predicted by Mendel. They noticed that some traits are sex 
linked which means that these traits are determined by "beads" located on 
the chromosome which determines gender. Furthermore, they noticed that 
some traits can be linked and tend to transfer together to the gametes just 
because they probably are located close to each other on the same chromo- 
some. But sometimes those connections are broken and they can assort to 
the gametes independently producing a new combination of traits. It is the 
last notice which triggers the common conviction that all alleles are finally 
assorted independently into the gametes. In fact, it could be true only in 
very large populations sharing the common genetic pool, called Mendelian 
population. It is not a case in natural populations with spatially distributed 
individuals, where genetic relations (or distances) between individuals have 
to be taken under account. Let's consider a situation presented in FigjT] 
describing the classic Mendel's laws. Each parental individual (P) possesses 
two pairs of alleles located on different chromosomes, the first parent has all 
alleles dominant: ^||^, B\\B and the second one - all alleles recessive: a\\a 
and b\\b. Since each of these individuals can produce only one type of ga- 
metes: AB or ab, in the first generation (-F1) all individuals have genotypes 
A\\a and B\\b and pheno types determined by the dominant alleles. Each 
of these Fl individuals can produce four types of gametes: AB, Ab, aB, 



and ab. Assuming independent and random assortment of chromosomes, the 
fraction of each type of gametes is 0.25. That determines the ratios between 
genotypes of the F2 generation, assuming no gamete preselection. The geno- 
types shown in the table determine the phenotypes of individuals with ratios: 
9/16 individuals with both traits determined by dominant alleles, 3/16 for 
each of two phenotypes with one dominant and recessive trait and 1/16 for 
the phenotype with both recessive traits. 
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Figure 1 : Scheme of inheritance of two pairs of alleles in case when they assort 
into gametes independently and quantitative relations between genotypes in 
the second generation. 



Now, let's analyze the situation where the considered pair of genes is 
located on the same chromosome close to each other: Ai?||yli? and a6||a6. 
The parental individuals can produce gametes with the same genetic contents 
as in the previous case. All Fl individuals are also yli?||a6. Nevertheless 
these Fl individuals are going to produce only AB and ab gametes and the 
F2 generation will be much less genetically and phenotypically diversified, 
only Ai?||^i?, ^i?||a5, aa||^i? and a6||a6 with a corresponding phenotypic 
ratio: 3:1. Let's assume that A and B denote functional alleles of genes 
responsible for important functions of an organisms while a and b are their 
corresponding non- functional, deleterious alleles. Any organism with a\\a 
and/or b\\b genetic configuration has to suffer from genetic disease(s). It 
is obvious that the two above situations are quite different. In the case of 
independent assortment Figjllthe fraction of 7/16 of F2 would be suffering 
while in the second case Fig J2] only 1/4 is going to suffer. It is obvious that 
selection would favour some genetic configurations. Paradoxically, there are 
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Figure 2: Scheme of inheritance of two pairs of alleles in case when they 
are linked and assort into gametes as a cluster and quantitative relations 
between genotypes in the second generation. 



no recombinations between alleles in the second case though recombinations 
suppose to ameliorate the genetic conditions of genomes. 

A situation where more linked and complementing alleles occur in the 
heterozygous state in the parental genomes is shown in Figj3] "Complement- 
ing" means that dominant alleles hide (complement) the effect of their poten- 
tially deleterious, recessive counterparts. All considered loci in the parental 
genomes are occupied by pairs of different alleles - one recessive, another one 
dominant. Thus, the individuals have all phenotypic traits determined by 
the dominant alleles. Since in the described case we have assumed that all 
alleles are closely linked thus, there are no recombinations inside the group 
of genes (called cluster of genes), and only two types of gametes are pro- 
duced. Assuming that these gametes combine randomly to form zygotes, 
half of the produced zygotes would be composed of complementing gametes 
while the other half would be composed of identical gametes (shadowed in 
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Figure 3: Scheme of inheritance of complementing gene cluster. Notice that 
without recombination, if gametes fuse randomly, half of individuals in the 
next generation possess complementing haplotypes while half of them have 
pairs of identical haplotypes, they express many deleterious phenotypic traits 
(grey boxes). 



the figure). In the last cases all loci would be homozygous and some of 
them would determine recessive traits. If there is a very low recombination 
probability inside the cluster, then we have a situation described in Figj4] 
One individual produces the recombined gametes (the right two gametes). 
These gametes cannot "find" a fully complement gamete in the pool of non- 
recombined gametes produced by the rest of individuals in the population 
(the left two gametes). Thus, a gamete with the recombined cluster produces 
a zygote with some recessive characters which could die even before birth 
(zygotic deaths). 

Let's stay with the example described in Figj4] Both parental genomes 
have identical configurations of their haplotypes (clusters of genes) and re- 
combination inside the cluster is low but possible so, it can happen with a 
finite frequency during each gamete production. What could happen with 
their genomes after many generations? If there is no selection for any spe- 
cific genome configuration, mating is random, the number of generations 
is very high and population is very large, then one can expect that many 
recombinations can happen inside the clusters and the original allele config- 
urations would be disrupted. In such a scenario alleles seems to be randomly 
assorted into gametes and the Hardy- Weinberg principle (HWP) is in force 
|13] |47j . The fractions of any allele combination can be calculated just from 
the frequencies of alleles in the genetic pool. Nevertheless, natural popula- 
tions do not fulfil the conditions required for such a "Mendelian population". 
Especially, populations sharing a common genetic pool are usually much 
smaller than the whole population of the species, there are some mating 
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Figure 4: Scheme of inheritance of complementing gene cluster with one re- 
combination event during gamete production in one parental genome. Notice 
that recombination in the second (left) parent produces gametes which can 
not form fully complementing zygotes with non-recombined gametes. Thus, 
all individuals in the next generation possess pairs of recessive alleles and 
express many deleterious phenotypic traits (all boxes are grey). 

preferences, subpopulations could be separated by physical or geographical 
barriers, selection favours or discriminates some genetic configurations and, 
what is the most important, all these conditions are superimposed on the re- 
stricted and unevenly distributed recombination events along the eukaryotic 
chromosomes. Thus, emerging of gene clusters should be possible and com- 
plementation of whole clusters in genomes should be also observable. In the 
next chapters we present computer models describing the conditions where 
complementation of large gene clusters is generated showing a possible way 
of eukaryotic chromosomes' evolution. 



2 Modelling the evolution of chromosomes 

2.1 Populations without age structure 

Computer modelling of the population evolution were performed according 
to the Monte Carlo method described previously |49) |50) . Virtual popu- 
lations are composed of individuals represented by their diploid genomes. 
Genomes are composed of pairs of bitstrings (homologous chromosomes), 
each L bits long. A bit set to corresponds to the wild (functional) form of 
an allele while a bit set to 1 corresponds to the defective allele. It has been 
assumed that all defective alleles are recessive which means that two alleles 
at corresponding loci in both chromosomes (the same positions in bitstrings) 
have to be defective to determine deleterious phenotype. To reproduce, an 



individual seeks for a partner. Each one of two partners produces one gamete 
and the fusion of the gametes gives a newborn. Gametes are produced by 
rephcation of two parental bitstrings, introducing M mutations into random 
positions of each new copy, and recombination between copies in the corre- 
sponding but random positions mimicking crossover. Mutation changes bits 
to 1, if a bit chosen for mutation is already set to 1 it stays one - there 
are no reversions. Frequency of mutations M and recombinations C are very 
important parameters of the model. After each Monte Carlo step (MCs) 
2% of individuals are randomly eliminated from the population. This gap 
is filled up in the next MCs by surviving newborns. Newborns with at least 
one position determining a deleterious phenotype (both bits at corresponding 
positions set to 1) will never reproduce and are dying. 

2.2 Age structured populations 

To simulate the evolution of the age structured populations we have used 
diploid version of the Penna model |34] . The Penna model differs from that 
one described above by consecutively expressing (switching on) the bits lo- 
cated on the bitstrings - in the first MCs after reproduction, a newborn 
expresses the first pair of bits, in the seconds MCs - the first two pairs and 
so on. Thus, the age of individuals corresponds to the number of switched 
on bits. If the maximum number of the expressed deleterious phenotype 
traits (threshold T) is reached, this individual is killed because of its genetic 
status. In such populations the maximum age equals L, but usually, after 
long enough simulations all individuals eventually die because of their ge- 
netic death before they reach the age L and the age structures of simulated 
populations correspond very well to natural diploid populations. Additional 
parameters in this model are: minimum reproduction age R - individuals 
which reach this age can reproduce, and B - number of newborns which 
can be produced by one female during each MCs. The gender of each new- 
born is determined - female or male - with equal probability. In panmictic 
populations, a female at reproduction age looks for a male partner in the 
reproduction age in the whole population to produce an offspring which is 
born with probability described by Verhulst factor: V = .7 * ; where Nt is 
an actual number of individuals in the population while Nmax is maximum 
size of population. There are no random deaths of individuals after birth 
|23) . Each female being at reproduction age is chosen for reproduction only 
once during one MCs, while the male after reproduction is going back to the 
pool of males and again can be chosen by another female for reproduction. 
Such a population is called panmictic. To impose some restriction on pan- 
mictic reproduction it is possible to divide the whole population for smaller, 
non-overlapping subpopulations or to introduce a spatial distribution of in- 
dividuals determining distances between sexual partners and their offspring, 
with partners selected from the neighbourhood and birth requiring a free 



nearby place. 

2.3 Spatially distributed populations 

To study the role of spatial distribution of the large populations, individuals 
were distributed on a square lattice - at most one individual in one square. To 
avoid the border effect, the lattice was wrapped on a torus. To reproduce, 
the female chooses a partner located in a distance not larger than P and 
puts an offspring at distance not larger than O if there is any free square at 
such a distance. The role of spatial distribution was studied in both models 
described above - without age structure, introducing random elimination of 
declared fraction of population every MCs, or age structured populations, 
where individuals aged and were eliminated by genetic death only. 

3 Results and discussion 

3.1 Emerging complementarity; transition between comple- 
menting and purifying selection strategies 

Complementing groups of genes are described in the introduction and shown 
in Fig|3] The easiest way to show the emergence of complementation in 
computer simulations is to declare that there are no recombinations between 
bitstrings and that heterozygous loci increase the fitness of individuals, like in 
case of sickle cell anaemia trait |37| . cystic fibrosis [lOj or Tay-Sacks syndrome 
in humans ^. Under such condition (L = 100, iV = 1000, M = 1,C = 0), 
in the first version of the model, genomes reach full heterozygosity (all loci 
are heterozygous) after about 10^ MCs |49j. One recombination event be- 
tween bitstrings during each gamete production prevents complementarity 
to emerge. In the fully heterozygous genome, the frequency of defective 
alleles is 0.5. Since in the model, one homozygous locus with both alleles 
defective kills the individual, assuming that defective alleles are distributed 
randomly along the bitstring, the probability of forming a surviving, diploid 
genome without any locus with both defective alleles (homozygous) is of the 
order of 10~^^. Populations under such conditions should be extinct, but 
they survive. Surviving is possible because the distribution of defects along 
the chromosomes is not random. Selection leads to the state where mainly 
two different types of bitstrings are present in the whole genetic pool. If 
two identical bitstrings form a newborn's genome - it is eliminated because 
many homozygous loci determining deleterious phenotypes appear in such 
a genome. If two different (complementing) bitstrings form a genome - it 
survives and it is the fittest because it has the maximum number of het- 
erozygous loci. Thus, the probability of forming the surviving genome is 
not 10~^^ but 0.5 in case when only two complementing haplotypes exist in 
the genetic pool. Recombinations disrupt the complementing haplotypes, as 
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Figure 5: Fractions of defective alleles in the diploid genomes with 1 crossover 
between homologous chromosomes and without recombinations. Number of 
alleles in one chromosome (bitstring) - 100. 



presented in Figj4] Further simulations have shown that it is not necessary 
to declare the advantage of heterozygosity to produce complementing haplo- 
types (FigJ5|. Simulations performed without recombinations between chro- 
mosomes during gamete production lead to full complementarity between 
haplotypes, while inclusion of recombinations between them prevents such 
a complementarity to emerge and purifying selection prevails. The question 
is - what is the character of the transition between the two strategies of 
the genome evolution under an intermediate recombination rate? Simula- 
tions performed under the same parameters: L = 100, M = 1,A^ = 1000 
with only one free parameter - recombination rate - have shown that popula- 
tions choose between the two strategies around the recombination rate 0.14 
(FigJGj). For the above parameters, below this frequency of recombination, 
complementation prevails while above this value, the strategy of purifying 
selection is winning. The range of recombination rate where the transition 
between these strategies occurs is very narrow - of the order of 0.01 |50| . 
The recombination rate at which the transition between the two strategies 
of evolution is observed has been called critical recombination rate - C* . At 
the critical recombination rate, the probability of forming a surviving zygote 
from two gametes is the lowest (see FigJTJ). 
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Figure 6: Transition between complementation and purifying selection. Be- 
low recombination rate 0.14 high fraction of defective alleles is observed and 
complementation prevails. At recombination rate 0.14 — 0.15 (critical recom 
bination rate C*) sharp transition for purifying selection is observed. 
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Figure 7: Relation between the probability of forming a surviving zygote 
and recombination rate. Y-axis is scaled in the average number of offspring 
which have to be formed to get one survivor. 



3.2 Relation between the critical recombination rate and 
population size 

There is a power law relation between the value of critical recombination 
rate C* and the population size (Figjsl notice the logarithmic scale of both 
axes). Larger panmictic populations have a lower critical value C* [50]. The 
explanation of this phenomenon seems to be trivial; in small populations the 
average genetic relation between individuals is high and the total number of 
crossover events in a given chromosome which could happen in the lineages 
of sexual partners to their common ancestor is relatively low - the inbreeding 
coefficient is high. In the larger populations this number is higher and the 
disruption of gene clusters is more probable. 



3.3 Transition between complementarity and purifying se- 
lection strategies in spatially distributed populations 

One can say that transition between the purifying and complementing strate- 
gies of evolution has not the character of phase transition because it depends 
on population size. In fact the transition directly depends on the probability 
(frequency) of recombination in the whole genetic linage separating sexual 
partners. It is possible to show such relations in simulations performed on 
a square lattice. If a female can choose a partner at distance no larger than 
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Figure 8: Relation between critical recombination rate and effective popula- 
tion size. Notice the logarithmic scale of both axes. 



P and she can locate her baby at distance no larger than O then the effec- 
tive population size and inbreeding coefficient is determined by these two 
parameters: P and O, independently of the population size (dimensions of 
the lattice). Results of simulations (FigJ9| suggest that under all other pa- 
rameters fixed, C* value does not depend on population size |44| |46) . For 
increasing P and/or O, C* decreases and populations enter the purifying 
strategy at lower recombination rate (results not shown). 



3.4 Zygotic death close to the critical recombination rate 
and populations' extinction 

In the versions of model where population size was controlled by Verhulst 
factor instead of being kept constant, it has been observed that population 
size depends non-monotonically on recombination rate. Under low and high 
recombination rate populations are larger while in the intermediate recombi- 
nation rate they are smaller or could be even extinct. This effect is connected 
with higher zygotic death close to the critical recombination rate (Figj7| |50) 
|46) . Zygotic death in this case means, that fusion of two gametes produces 
a dying offspring. If the birthrate is too low to compensate zygotic death 
the population is extinct |39) . The same effect was observed in populations 
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Figure 9: Transition between complementing strategy and purifying selection 
in simulations on square lattice of different sizes (see legend in upper right 
corner). More detailed information in the text. 



expanding on square lattices (Fig, 10) 



3.5 Relations between chromosome size, coding ciensity anci 
critical recombination frequency 

The size of natural chromosomes is measured by the number base pairs, by 
the number of genes located on them or by the genetic units which correspond 
to the frequency of recombination. One unit (centiMorgan - cM) corresponds 
to the distance between two markers where the probability of recombination 
during one meiosis is 0.01. The smallest human chromosome (excluding Y 
chromosome) contains 280 genes while the largest one about 2200 genes. 
On the other hand genetic length of the shortest one is 62cM while of the 
largest chromosome is 278cM (for details see International HapMap Con- 
sortium 2005 and [22]). Since the correlation between coding capacity and 
recombination rate is not very high in the human genome (Fig]ll[) - our nat- 
ural chromosomes may differ significantly in relations between recombination 
rate and number of genes. The most densely packed human chromosome (19) 
possesses almost 14 genes per cM while the least packed one (18) has only 
2.5 genes per cM on average. What is more interesting, these two chromo- 
somes have almost the same length measured in cM. In computer modelling 
usually the number of genes (bits) is considered as a parameter describing a 
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Figure 10: Expansion of populations on lattice depending on recombination 
rate. Blue/red scale shows the number of individuals on lattice after given 
time of simulations measured in Monte Carlo steps. 
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size of chromosome. Recombination is considered as an independent param- 
eter of simulations. In fact there is a very strong relation between these two 
parameters. In the previously described results it has been shown that chro- 
mosomes containing the same number of genes at recombination rate below 
the C* value enter the complementing strategy while above the C* value - the 
purifying selection. That means that for higher coding density per recombi- 
nation unit complementation should be expected [50j [22j . The relations are 
much more complicated. Generally, there is a power low relation between the 
C* value and the effective population size for any chromosome size (Fig]12[ 
notice log /log scales). Nevertheless, increases of C* values in the log /log 
plots are observed for growing numbers of genes per chromosome - chromo- 
somes with higher content of genes can enter the complementation strategy 
under higher recombination rate. Though, the relation is not linear. In 



Fig, 13 the relations between C* and the number of genes per chromosomes 
are shown for different population size. At the same figure, the data for 
real human chromosome are plotted (circles). These data suggest that most 
of human chromosomes could enter complementing strategy if the effective 
population size is in-between 100 and 300. Chromosome 22 could enter the 
complementing strategy even if effective population size is of the order of 
500. Computer simulations of evolution of genomes consisting of two pairs 
of chromosomes differing in the number of genes have shown that one pair 
of chromosomes can be under the complementing strategy while the other 
pair under purifying selection. These results were obtained for populations 
without age structure thus; the effective population size in the modelled 
populations should be multiplied by the factor of three to correspond the 
natural effective populations since only about 1/3 of the population is in re- 
productive age. The same fraction of individuals being in reproduction age 
is observed in human populations ^4] . Genetic analyses suggest that humans 
evolved in the effective populations of this order |41| |21] . 

3.6 Complementarity in polymorphic loci 

In all simulations described above only two forms of alleles were considered: 
functional one - dominant and defective one - recessive. A homozygous lo- 
cus with both alleles recessive determined the deleterious phenotypic trait. 
In the nature, relations between alleles are more complicated that is why 
some simulations have been performed with polymorphic loci. Each gene 
composed of 8 bits could be represented by 256 different alleles, with their 
adaptive values from to 8 depending on the number of bits set to 1. Thus, 
the sum of two alleles values occupying the same locus in the homologous 
chromosomes could vary from to 16 and the environment could require a 
declared value of a given locus. This demand could be fulfilled by differ- 
ent pairs of "complementing" alleles. In such simulations, the emerging of 
complementarity was also observed [5]. 
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Figure 11: Relation between the coding capacity of human chromosomes 
measured in number of genes (x-axis) and average number of crossover events 
during one meiosis (y-axis). 
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Figure 12: Relations between critical recombination rate and effective popu- 
lation sizes for different chromosomes' lengths (number of bits in the bitstring 
in the right corner legend). Notice double logarithmic scale. 
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Figure 13: Relations between critical recombination rate and chromosome 
length for different effective population size. Corresponding data for human 
chromosomes (plotted as stars) suggest that they could be close to critical 
conditions assuming that humans evolved in populations of effective size of 
hundreds of people. 
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3.7 Consequences of complementarity 

Computer simulations have shown that critical parameters for emergence of 
complementarity are recombination frequency, effective population size and 
coding density of chromosomes. The history of evolution of our species and 
human genome parameters suggest that the consequences of complementar- 
ity should be seen at least in some regions of human chromosomes. Inside 
the densely packed chromosomes, like 19, some tendency to grouping genes 
into clusters is observed [9j . Positions of these clusters should correlate with 
so-called recombination deserts. Recombination hot spots and deserts seem 
to be non-randomly distributed on human chromosomes [1^ |35] |2^ US] . Is 
it possible to predict their appearance on the basis of computer simulation 
results? 

3.8 Distribution of recombination events along chromosomes 
and gene clustering 

In all simulations of the role of recombination frequency in the genome evo- 
lution, it has been declared that the recombination events are distributed 
evenly. In the course of simulation, when the distribution of recombinations 
were analysed, only those events were counted which lead to the gametes 
producing surviving zygotes. Haplotypes of these gametes were represented 
in the analysed individuals. We called them "accepted recombination events". 
Usually, when analysing recombinations in the living systems, also only those 
accepted events are counted. It has been noticed that the distribution of 
accepted recombination events in the genomes of simulated populations de- 
pends on parameters of simulations. If evolution is studied in small effec- 
tive populations under relatively low recombination rate, the central parts of 
chromosomes start to form clusters of genes where recombinations have dele- 
terious effect on reproduction potential |l6]. Gametes which are produced 
by recombinations in these regions have lower chance to produce surviving 
zygotes. As a result, the recombination events in gametes which succeed 
in forming the surviving individuals have a characteristic distribution - a 
higher recombination frequency in the regions close to the ends of chromo- 
somes (subtelomeric regions) and a lower recombination rate in the central 
parts of chromosomes [6] . This distribution resembles the distributions ob- 
served for example in all human chromosomes, except of Y chromosome [8] 



|17) |19) |29| |33| |36) . In Fig 14 the distributions of recombinations along 



the virtual and real chromosomes are plotted. As has been mentioned before. 



the effects shown in Fig 14 apply for accepted recombinations. If one con- 
siders the problem from the point of evolutionary costs, it seems to be more 
effective to avoid the recombination in some regions rather than to elimi- 
nate recombinants from the population. It has been shown in simulations 
that the distributions of recombination spots are under selection pressure 
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Figure 14: Detrended cumulative plots of recombination rate distribution 
for human 13 chromosome and virtual chromosome. Increasing regions (sub- 
telomeric) indicate higher than average recombination rate while decreasing 
region (central part) shows the lower than average recombination rate. 



and can evolve |46) |20] . To find this, simulations start with recombinations 
distributed evenly. Frequencies of recombinations at given positions are in- 
herited, though they can be slightly changed from one generation to the 
other. As a result, the distribution of recombination changes into the char- 
acteristic pattern - higher recombination rate in the subtelomeric regions 
and lower in the central part, simultaneously in the central parts clusters 
of complementing genes emerge which can be observed as growing fraction 



of heterozygous loci in the simulated genomes |50) (Fig 15 ) or high linkage 
disequilibrium in natural chromosomes [36] . There is one very important im- 
plication of such an evolution of chromosomal structure - the distribution of 
recombination events seems to be optimized during evolution. Nevertheless, 
there are at least two equivalent parameters shaping this structure: recom- 
bination frequency and effective population size. It has been shown that the 
recombination frequency and its distribution along chromosomes could be an 
effect of long-term genome evolution while the effective population size can 
change much faster and could be even considered as an environmental condi- 
tion. Thus, one can expect that an optimum in the effective population size 
exists for the reproduction potential. For larger and smaller effective popu- 
lations the reproduction potential should decrease. In fact, it has been found 
that such an optimum exists for human population [14] . This phenomenon 
cannot be explained assuming the properties of Mendelian populations. 
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Figure 15: Distribution of heterozygous loci along the virtual chromosomes 
for recombination rate close to critical one. Compare with plots in Fig. [14] 
In the central part of chromosome where recombination rate is relatively low, 
the fraction of heterozygous loci (complementation) is high. 
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3.9 Sympatric speciation 

Sympatric speciation is a phenomenon of a new species emerging inside the 
population of an existing species without any physical, geographical or bi- 
ological barriers. For a long time it has been considered as a negligible 
phenomenon in the biological evolution (24!| . Recently it is still debatable 
|18) though, there are more and more known examples of such an evolu- 
tion and sympatric speciation attracts more attention [I] [3]. In the large 
Mendelian, homozygous populations sympatric speciation seems to be really 
impossible but if one considers a spatial distribution of individuals, simple 
physical distances between them can introduce some restrictions into the 
reproduction process - i.e. higher probability of finding a sexual partner in 
the neighbourhood rather than at long distances. Simulations on lattices 
with declared maximum distances for looking for partners and locating the 
offspring enable the emergence of heterogeneity of populations. In such pop- 
ulations the genetic relations between individuals are negatively correlated 
with distances between them. On the other hand the probability of mating 
between two individuals decreases with the distance. It is possible to follow 
(or almost directly observe) the evolution of individual genomes on lattice 
|44| |46) . Let assume that to each number from the range 1 to 2^'^ a specific 
colour is ascribed and a 24 bits long fragment of a bitstring, representing 
one haplotype of a diploid genome is considered as a number in a binary 
numeral system. Thus, a place when an individual is located is coloured 



according to 24 bits fragment of its genome. Fig 16 illustrates the evolution 
of population which started from one pair of ideal individuals and expanded 
onto the whole lattice. Colours of large radiating regions show the territories 
occupied by individuals with the same fragment of haplotype determining 
the colour. In FigjlT] another case is presented. Lattice was wrapped on 
torus and the initial population was composed of individuals with randomly 
set genomes and age. In the course of simulation the whole initial popula- 
tion was divided for subpopulations seen as distinctly coloured territories. 
In both cases, individuals located close to the borders of those distinctly 
coloured regions can interbreed and form hybrids but these hybrids can not 
survive up to their reproduction age - by definition, the individuals occupy- 
ing those differently coloured territories belong to different species. In both 
cases it was complementation which was responsible for speciation. Rela- 
tively low recombination rate and high inbreeding established by distances 
P and O generate the complementing clusters of genes. Since such a com- 
plementation starts in different parts of lattice independently, the structure 
of complementing clusters are different and do not fit to each other. Thus, 
emerging subpopulations with different clusters behave like the new differ- 
ent species. Mechanisms of complementing strategies described in previous 
sections show that complementation starts in the central parts of chromo- 
somes. Lateral fragments could be under purifying selection. It could be also 
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directly observed in the results of simulation. The subpopulation inside the 



square shown in Fig, 16 seems to be very homogenous according to the central 
fragment of individuals' genomes (Fig]18|panel A). But if the same fragment 
of population is coloured according to the lateral parts of chromosomes, then 



the populations seems to be very polymorphic (Fig, 18 panel B). According 
to this observation, central parts of chromosomes should be rather conserva- 
tive and responsible for belonging to higher order taxons while in the lateral 
parts, less conservative genes should be located. In the real human chro- 
mosomes, the fraction of genes which have their homologous counterparts in 
the mouse genome is lower in subtelomeric regions than in the central parts 
[6] . There is another interesting feature of sympatric speciation observed in 
virtual populations on lattices. As it has been noted, speciation is facili- 
tated when intragenomic recombination is low and inbreeding is high (small 
effective population). Intragenomic recombination rate is fixed by evolution 
and probably changes slowly as an important feature of natural genomes. 
Inbreeding can change faster even following the environmental changes. If 
the fatal environmental disaster reduces the population on a given territory, 
the inbreeding is growing and speciation can occur like during the Permian. 
It has been also shown that sympatric speciation is faster in a changing 
environment even if the whole environment is changed uniformly 



3.10 Gamete recognition 

Complementation strategy assumes that two different (complementing) se- 
quences of alleles fit to each other producing a better fitted diploid genome. 
Assuming that defective alleles have recessive lethal characters, zygotes con- 
taining two identical clusters of genes should die and only those containing 
complementing clusters would survive. If we consider a set of chromosomes 
with only one pair of complementing clusters then the probability of zygotic 
death is 0.5. Evolutionary costs and reproduction would be more efficient if 
gametes could recognize which of them has an identical cluster and which 
one has a complementing cluster of genes before fusing to form a zygote |7] . 
Such systems of recognition or probing the information inside another cell are 
known even in prokaryotes - i.e. an entry exclusion system which prevents 
bacteria to engage in conjugational process if a partner cell already possesses 
genetic information to be transferred |26] |3^ . It is suggested that in humans 
the Major Histocompatibility Complex (MHC) can play such a role in prese- 
lection of mating partners or in the very early immune reaction of mother on 
embryo antigens | il5j | i31j . The MHC complex alone is not enough to guar- 
antee the fusion of complementing haplotypes. If complementing clusters 
can appear on many chromosome pairs - all these pairs should possess an 
operating recognition system. The mechanism should be located at the level 
of gametes and, to be efficient, it should be independent for different pairs of 
chromosomes nestling the complementing clusters of genes. There is a group 
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Figure 16: Speciation effect during tlie expansion of population on a square 
lattice. Population starts from one ideal pair of ancestors (Adam and Eve). 
Regions of different colours are occupied by different species. They are al- 
lowed to hybridize in the border regions but hybrids can not survive because 
of genetic reasons (haplotypes are not complementing) . 
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Figure 17: Syinpatric speciation on square lattice wrapped on torus. Initial 
population was randomly generated (age and genetic structure). The rest 
parameters like in Fig. [16] 
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Figure 18: Fragment of population marked in Fig. 16 Left panel coloured 



according to central part of chromosomes, right panel coloured according to 
lateral parts of chromosomes. Central parts seems to be more homogenous, 
conservative responsible for speciation effect and characteristic for a given 
species while the lateral parts are more polymorphic responsible for different 
individual traits inside species. 



of genes which could fulfil such a role - Olfactory Receptor genes (OR) 
This is the largest gene family in the human genome composed of almost 
1000 genes, clustered in many different groups located on almost all chromo- 
somes (excluding 20 and Y) and at least some of these genes are expressed 
during spermatogenesis |32] |12) . Thus, OR could play a role in chemotaxis 
of sperm cells enhancing the chance of the best fitted one for fertilization 
the egg 133. If we assume that each of our 22 pairs of autosomes have com- 
plementing clusters of genes, then a recognizing system should increase the 
probability of successful fertilization by several orders of magnitude. On the 
other hand, the existence of such a system explains why a male produces 
such an enormous surplus of sperm cells. 
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